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Abstract 

The time to flowering and maturity are ecologically and agronomically important traits for soybean landrace and cultivar 
adaptation. As a typical short-day crop, long day conditions in the high-latitude regions require soybean cultivars with 
photoperiod insensitivity that can mature before frost. Although the molecular basis of four major £ loci {£/ to £4) have 
been deciphered, it is not quite clear whether, or to what degree, genetic variation and the expression level of the four £ 
genes are associated with the time to flowering and maturity of soybean cultivars. In this study, we genotyped 180 cultivars 
at £/ to £4 genes, meanwhile, the time to flowering and maturity of those cultivars were investigated at six geographic 
locations in China from 201 1 to 2012 and further confirmed in 2013. The percentages of recessive alleles at £/, £2, E3 and £4 
loci were 38.34%, 84.45%, 36.33%, and 7.20%, respectively. Statistical analysis showed that allelic variations at each of four 
loci had a significant effect on flowering time as well as maturity. We classified the 180 cultivars into eight genotypic groups 
based on allelic variations of the four major £ loci. The genetic group of el-nf representing dysfunctional alleles at the £/ 
locus flowered earliest in all the geographic locations. In contrast, cultivars in the E1E2E3E4 group originated from the 
southern areas flowered very late or did not flower before frost at high latitude locations. The transcriptional abundance of 
functional £7 gene was significantly associated with flowering time. However, the ranges of time to flowering and maturity 
were quite large within some genotypic groups, implying the presence of some other unknown genetic factors that are 
involved in control of flowering time or maturity. Known genes (e.g. E3 and £4) and other unknown factors may function, at 
least partially, through regulation of the expression of the £/ gene. 
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Introduction 

It is generally considered that soybean has been domesticated in 
China for several thousand years [1]. During the domestication 
and breeding processes, the time to flowering and maturity 
underwent natural and human selections since these traits affect 
geographic distribution, sowing time and final yield of soybean 
cultivars. Latitudinal distribution for a given cultivar is typically 
restricted to a limited north south zone for the maximal yield. 
Since soybean is a short-day crop, soybean cultivar acquired a 
certain photoperiodic insensitivity to adapt to higher latitude 
regions where there is a longer daylength in the summer and a 
shorter frost-free period a year. A maturity group (MG 000, MG 



00, MG 0, MG I ~ MG X) system has been well developed to 
estimate the range of adaptability to latitudinal or geographic 
zones of soybean cultivars in the USA and Canada [2,3]. MG 000, 
MG 00, and MG 0 are the earliest maturity cultivars and are 
mainly distributed in production areas of the southern Canada [4], 
while MG I and MG II are typically grown in the northern region 
of the USA, the rest of the groups are succeedingly grown further 
south [5]. Although this system has been practically used for 
several decades in soybean breeding and soybean production, 
assignment of a cultivar into a maturity group is time-consuming, 
and is sometimes hard due to the ambiguous phenotypic data [5] . 

As early as 1920s, researchers began to study the genetic factors 
regulating flowering time [6] . El to E8 loci, known as the E series. 
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have been geneticaUy identified (El and E2 [7], E3 [8], E4 [9], E5 
[10], E6 [11], E7 [4], and E8 [12]). Additionally, the J locus for 
long juvenile was also genetically detected [13]. The i? series of loci 
also underlie the maturity or the duration of the reproductive 
phase (DRP) in soybean [14]. Relationship between maturity 
groups and genotypes of the E loci was inferred by genetic study 
using Harosoy or Clark near isogeneic lines (NILs) [5] . 

Recently, the availability of soybean genomic information has 
accelerated positional cloning in soybean [15]. To date, the 
molecular bases for El to E4 loci have been uncovered [16-19], 
although other loci e.g. E5 to E8 loci still remain unknown. Genetic 
analysis indicated the E3 gene is partially dominant over E4 [20], 
and both genes respond to the quality of light [20,21]. Recent 
progress made out that the E3 and E4 genes encode two 
phytochromes, GmPhyAS and GmPhyA2 [16-17]. In Arabidopsis, 
the dKLlXrEA (GI) gene plays an important role in the GI-CO- 
FT photoperiodic flowering pathway and also can direcdy regulate 
the _FTgene [22,23]. Positional cloning revealed that OmGIa, an 
ortholog of G/ gene is responsible for the E2 locus in soybean [18]. 
The E2 locus is reported to have no strong association with 
photoperiodic response, and functional mechanism of the E2 gene 
in soybean is stiU needed to be elucidated further [18]. 

Generally, the El locus has major impact on flowering and 
maturity. The notice of its effect on flowering could trace back as 
early as 1920s when photoperiodism was discovered [24]. 
Although the El was named in 1971 [7], the cloning of this gene 
was rather difficult since this locus is located in the pericentromeric 
region [25]. Xia and co-researchers successfully disclosed the 
molecular identity for the El locus through nearly 1 0 years of fine 
mapping and functional confirmation [19]. The El gene has a 
bipartite nuclear localization signal and a domain distantly related 
to the AP2 domain. Functional analysis showed nonfunctional El 
alleles displayed an early flowering time phenotype, regardless of 
the genetic background at other E loci or daylength condition 
[19]. The strongly suppressed expression in short day condition 
also infers that this gene is strongly related to photoperiodic 
response as a flowering repressor, which is consistent with the 
results obtained in previous genetic studies [19]. In addition, the El 
locus might have pleiotropic effects on other important agronomic 
traits [26], e.g. branching [27] and chilling tolerance [28]. 

Recently, Tsubokura et al. genotyped 63 accessions at the El, 
E2, E3 and E4 genes using DNA markers, and also sequenced the 
promoter and coding regions of those genes to detect the genetic 
variation at single nucleotide level for 39 accessions [29]. The 
result showed that these allelic variations could explain about 62 to 
66 ""(i of the phenotypic variation of flowering time among the 63 
plant accessions used [29]. Xu et al (2013) studied 53 photoperiod 
insensitive soybean accessions including some cultivars from 
HeUongjiang, the most northern province in China, and classifled 
them into 6 genotypic groups using genotype data of El to E4 
[30] . However, the general information on genotypic variations at 
the four known E loci among cultivars from different soybean 
growing areas or geographic regions with different range of 
photoperiod sensitivity in China is stiU lacking. Therefore, the aims 
of this study were to investigate the extent to which the genetic 
variation at El through E4, as well as expression of the El gene 
could explain the phenotypic variation of the time to flowering and 
maturity for cultivars collected from different soybean growing 
areas in China. The results obtained in this study will be useful for 
identification and cloning of new genes involved in flowering time 
or maturity, as well as for marker assisted selection in soybean 
breeding. 



Materials and Methods 

Soybean Cultivars and Accessions 

A total of 180 cultivars were mainly obtained from the Gene 
Resource Center ofjilin Academy of Agricultural Sciences, China. 
The origin and other traits for these cultivars are listed in Table SI 
in detail. 

Genotyplng 

A standard CTAB DNA extraction protocol was followed [31]. 
The gc-notyping of all cultivars at the El, E2, E3 and E4 genes was 
performed using known DNA markers (Table S2). Electrophoresis 
was conducted by either agarose gel or high-efficiency genome 
scanning (HEGS) with non-denaturing 11-13% polyacrylamide 
separating gels and 5% stacking gels [32,33]. The gels were stained 
with GelStain (Transgen, Beijing, China), and visualized with Gel 
Doc XR Molecular Imager System (Bio-Rad, USA). 

(1) Genotyping of the El Gene. Genomic DNA was 
amplified with primer pair of TI-Fw and TI-Rv [19]. The PGR 
was performed using the following program: 30 cycles at 94°C for 
20 s, 58°C for 30 s, and 72°C for 30 s. A 443/444 bp fragment 
(Figure 1 A) was amplified and subjected to Taql or Hinfl digestion. 
The banding patterns were used to distinguish among El, el-as, 
and el-fs [19]. The PGR product amplified from el-as allele was 
cut into two fragments, 410 bp and 34 bp by Taql (arrows in 
Figure IB,) while that amplified from El or el-fs allele remained 
uncut. For discrimination between El and el-fi alleles, the 
fragment amplified from el-fs allele could be cut into four 
fragments (234,117,47 and 34 bp), while only three fragments 
were present for the El allele. Besides, the el-nl allele lacking the 
443 bp band was indicated by triangles in Figure lA to IC. 

(2) Genotyping of the E2 Gene. The 142 bp fragment was 
amplified with primer pair SoyGI_dCAP_Dra_fw and SoyGI_d- 
CAP_Dra_rv [18,29] and subjected to Dral digestion. The 
fragment amplified from E2 allele remained uncut, while that 
from e2 allele could be cut into 115 bp and 27 bp fragments 
(Figure ID). 

(3) Genotyping of the E3 Gene. Genomic DNA was 
amplified with a primer set consisting of four primers, 
E3_08557FW'', E3_09908RV, E3Ha_1000RV and 
E3T_0716RV [29]. A 1406 bp fragment could be specificaUy 
amplified from E3-Mi allele, while a 269 bp fragment could be 
specificaUy amplified from e3-t allele. A 557 bp fragment could be 
amplified from either E3-Ha or e3-Mo alleles (Triangle, Figure IE). 
For further discrimination between E3-Ha and e3-Mo, the PGR 
product was subsequentiy amplified with a primer pair of 
E3_08094FW and E3_08417RV, and subjixted to MsA digestion 
(Figure IF). The fragment amplffied from the E3-Ha allele 
remained uncut; while that from e3-Mo allele could be cut into 
223 bp and 101 bp fragments. For specifically identification ot e3- 
fs allele, the fragment (758 bp) was amplified with primer pair of 
e3-fsFV\'' and e3-fsRV and subjected to Alel digestion. The 
fragment amplified from the e3-fs allele remained uncut, while 
that amplified from the E3-Ha was cut into 552 bp and 206 bp 
fragments (arrows in Figure IG). 

(4) Genotyping of the E4 Gene. Using a set of primers, 
PhyA2-For, PhyA2-Rev/E4 and PhyA2-Rev/E4 [29], an 837 bp 
fragment could be specifically amplified from the e4-S0REl allele, 
while a 1,229 bp fragment could be generated from other types of 
E4 alleles (Figure IH). For discrimination between e4-kam, e4-kes 
and the E4 alleles, the PGR product was amplified with primer 
pair of e4-kamF\V and e4-kamRV, and was digested with AfRl or 
BspHl. The fragment from e4-kam allele could be cut into 286 and 
208 bp fragments hy AfM (Figure II). Also the fragments amplified 
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Figure 1. Genotyping methods for £7 to f4loci. A-C: Genotyping of the El gene. Fragment was amplified by primer pair of Tl-Fw and Tl-Rv (A), 
subsequently subjected to Taq\ digestion (B) and H/Vifl (C). The triangle represents the el-nl type (lacking of fragment). The arrows in B and in C are 
representing el-as and el-fs genotypes, respectively. D: Genotyping of the E2 gene, fragment was amplified with primer pair of SoyGI_dCAP_Dra_fw 
and SoyGI_dCAP_Dra_rv, after digestion of Dral, the e2 was cut into two fragments, while E2 genotype remained uncut (arrows). E-G: Genotyping of 
the E3 gene. E: bands with three sizes were generated using the mixed primers. Band specific for E3-Mi and ei-fr were indicated by arrow and star, 
respectively. A 557 bp fragment (triangle) could be amplified either for E3-Ha or e3-Mo genotypes. F: PGR product yielded from primer pair 
£3_08094FW and f3_0841 7RV were digested with Msel. The E3-Ha genotype remained uncut, while e3-Mo allele (arrow) could be cut into 223 bp and 
101 bp. G: specific determination of £3-/5 type (arrow). CAPE primer pair f3-fsFW/f3-fsRV, restriction enzyme: Ale\. H-J: Genotyping of the E4 gene. H: 
Using three mixed primers, a 837 fragment was specifically amplified from the E4 allele (arrow). I: CAPE primer pair e4-kam specific for e4-kam gene 
(arrow), enzyme AflW. J: CAPE primer pair e4-kes specific for e4-kes genotype (arrow), enzyme BspHI. 
doi:1 0.1 371/journal.pone.0097636.g001 



from e4-kes allele could be cut into 399 and 95 bp by BspHl 
(Figure IJ). While the fragment from the E4 allele remained uncut 
by either AJRl or BspHl. 

Phenotypic Observation 

Rl to R8, the reproductive stages of .soybean, were defined 
according to Fehr's system [34]. Rl refers the beginning of bloom 
when the opening of the first flower was found at any node on the 
main stem. R7 stands for the beginning of maturity when one 
normal pod on the main stem has reached its mature pod color 
(normally brown or tan); R8 refers full maturity when 95 percent 
of the pods have reached their mature pod color. For a given 
cultivar, each specific R stage is defined only when at least 50"/o of 
individual plants reached that stage. At least 15 plants for each 
cultivar per geographic location were grown for the phenotypic 
evaluation. For convenience, the data for Rl, R7 or R8 of a given 
cultivar presented in a format of means ± standard derivation 
(s.d.) in this study were recorded as the number of days after 
emergence to the stage of Rl, R7 and R8, respectively. 

Six geographic locations were chosen for evaluating photope- 
riodic responses. Three of them were located within Northern 
latitudes of 43° to 46°N, the critical regions for photoperiod 
response since most cultivars can mature during the frost free 
period and display contrasting dififerences in time to flowering. 
The other three different latitudinal locations were located 



successively in southern area (Figure 2A). Six locations are: 1 
Harbin (HRB): Research field at the Campus of Northeast 
Institute of Geography and Agroecology, Harbin, Heilongjiang 
(45°70' N, 126°64' E); 2. Mudanjiang (MDJ): Mudanjiang 
Research Station, Heilongjiang Academy of Agricultural Science 
(44°42'N, 129°52'E); 3. Gongzhuling (GZL): Gongzhuling 
Research Station, Jilin Academy of Agricultural Science, Gongz- 
huling, Jilin (43°53' N, 124°84' E); 4. Jinan (JN): Campus of 
Shandong Normal University, Jinan, Shandong (36°66' N, 
11 7° 17' E); 5. Huaian (HA): Huaiyin Research Station, Jiangsu 
Academy of Agricultural Science, Huaian, Jiangsu (33°57' N, 
119°04' E); 6. Nanjing (NJ): Luhe Research Station, Jiangsu 
Academy of Agricultural Science, Nanjing, Jiangsu (32°31' N, 
1 18°82' E) (Figure 2A). We did not include locations further north 
than Harbin (45°70' N, 126°64' E), considering the most cultivars 
collected from the South might not reach flowering (Rl) before 
frost. Also we did not include any location further south than 
Nanjing (32°3r N, 118°82' E), since flowering time or maturity 
time in the short-day condition become shorter and similar for the 
majority of cultivars collected. Based on the first two years' (2011 
and 2012) results (Table S3), around 50 representative cultivars 
mainly showing contrasting Rl or R7 or R8 within the same 
genetic group (see "Results" section) were selected for further 
phenotypic confirmation in 2013. 
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Figure 2. Geographic locations, daylengtli, and temperature of six experimental sites. A: The geographic locations of the six experimental 
sites. B: the average day length (hr) between 201 1 and 201 2. C: The changes in temperature recorded in 201 1 . Since there was no temperature data 
available in Gongzhuling (43 '53' N, 124'84'E), we used the data from the neighboring city Changchun (43 88' N, 125 35' E) (60 Km apart) instead. 
doi:1 0.1 371 /journal.pone.0097636.g002 



Quantitative Real-time PCR 

Quantitative real-time PCR (qRT-PCR) analysis was performed 
for plant materials taken from the location of Harbin on May 20, 
2012 when the day length was 16.10 hr. The cultivars were 
performed for qRT-PCR at this location were star (*) marked in 
the accession column in Table SI. The upmost fuUy expanded 
leaves from the apical meristem were sampled from 2.5 to 3 hr 
after dawn, 14 days after emergence. Total RNA was extracted 
using TRIzol (Life Technologies) method. The isolated RNA was 
then subjected to reverse transcription using the Superscript III 
Reverse Transcriptase kit. Quantitative real-time PCR was 
performed on each cDNA sample with the SYBR Green Master 
Mix (TransStart Top Green qPCR SuperMix) on Bio-Rad 
Chromo4 Detection System according to the manufacturer's 
protocol. The measured Ct values were converted to relative copy- 
numbers using the AACt method. Amplification of TUA5 
{Glyme05g29000.1) was used as an internal control to normalize 
all data. El expression level of Kariyutaka was used as a reference. 



Primers used were TUA5-¥ ,5'-TGCCACCATCAAGACTAA- 
GAGG and TUA5-K 5'-CTCTAATGGCGGCATCAAG; El-¥ 
5'-CACTCAAATTAAGCCCTTTCA and El-K 5'- 
TTCATCTCCTCTTCATTTTTGTTG; Three fuUy indepen- 
dent biological rephcates were obtained and subjected to real-time 
PCR run in triplicate. Raw data were standardized as described 
previously [35]. 

Weather Data Collection and Statistical Analysis 

The temperature data were downloaded from the National 
Meteorological Information Center (http://cdc.cma.gov.cn. Ac- 
cessed 2014, March 30). The daylength data were calculated at 
time.ac.cn/calendar/calendar.htm for the six sites where we 
performed the phenotypic observation. In order to statistically 
evaluate the effects of allelic variation for each E locus and their 
combinations on flowering time and maturity, multivariate 
analysis was performed using IBM SPSS Statistics 17.0, www- 
01.ibm.com/software/analytics/spss, based on generalized linear 



PLOS ONE I www.plosone.org 



4 



May 2014 | Volume 9 | Issue 5 | e97636 



Genotypes and Expression of E Genes in Soybean 




c 

TO 
CO 



O 
CM 



Day 

100- 
90- 

80 
70- 
60- 
50- 
40- 
30- 



20 



20 



B 





— r- 
40 



— r- 
60 



— r- 
80 



100 120 

R1, 2011, Mudanjiang 



140 Day 



Figure 3. The correlation analyses of the time to flowering (R1) between 2011 and 2012. A: IHuaian; B: Mudanjiang. 
dol:1 0.1 371/journal.pone.0097636.g003 



models. The Type III Sum of Squares was used to test eflects 
between subjects. All other statistical analysis, e.g. comparison 
between different genetic groups, was performed using GraphPad 
Prism Version 5.0 for Windows, GraphPad Software (San Diego 
California USA, www.graphpad.com). 

Results 

1. Correlation of Time to Flowering and Maturity among 
Different Geographic Locations 

After careful selection, 180 cultivars were used in this study, 
most of which were elite germplasm lines or cultivars extensively 



used in China, previously or currently. Also, cultivars from the 
USA, Japan, and European were included for comparison. Several 
cultivars carrying a specific genotype, e.g. Sakamotowase for el-fi, 
Kariyutaka for Ele3e4 were also included as control for 
monitoring the performance in genotyping and phenotypic 
evaluation. The phenotypic evaluation for the time to flowering 
and to maturity, as well as the other agronomic traits was 
conducted in the six locations from 20 11 to 2012 (Figure 2, Table 
SI). 

Daylength during growth season at different sites was variable as 
shown in Figure 2. Considering the frost-free period and 
temperature, we generally sowed seeds as the local farmers did 
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from late April to the middle of May for the northern sites: 
Harbin, Mudanjiang and Gongzhuling, or in June for the three 
southern sites, Jinan, Huaian, and Nanjing. The number of days 
from sowing to emergence was depending on temperature and soU 
moisture. For individual plant, Rl was recorded as the number of 
days from emergence to first flowering open. At the population 
level of a cultivar, Rl was recorded when more than 50% 
percentage of individual plants reached the Rl stage. Some data 
for Rl or R8 were missing mainly because cultivars did not reach 
the R 1 stage before frost or died from diseases during germination. 
The cultivars originally from southern regions, e.g. nearby the 
Yangtze River, flowered late in the three northern sites, Harbin, 
Gongzhuling and Mudanjiang. 

Generally, Rl is a relatively stable ecological character for a 
cultivar as indicated by higher correlation coefficients between 
different sites. At the southern sites of Huaian and Nanjing, the Rl 
of 2011 and 2012 were significantly correlated with correlation 
coefficient of 0.977** (hereinafter ** stands P<0.01) and 0.938** 
(Table 1, Figure 3). However, for the other sites, the correlation 
coefficients ranged from 0.790** to 0.923** (Table 1, Figure 3), the 
ffuctuations in correlation coefficients were possibly due to the 
influence of fluctuated environmental factors e.g. soil moisture, 
temperature from sowing to emergence between different years or 
between different sites. 

The number of days after emergence to the beginning of 
maturity (R7) and to fuUy maturity (R8) were measured in 20 1 1 
and 2012, statistical analysis showed a high correlation for R7 
between 201 1 and 2012 in Nanjing with a correlation coefficients 
of 0.941**. Generally, the correlation coefficient for R7 or R8 
between different sites was lower than that for Rl (Table 2, 
Figure 3). Correlation coefficients between Rl and R3 were higher 
than that between Rl and R7 or R8 for all the sites (Figure 4). 
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2. Allelic Variations at Four Major Alleles and Their Effects 
on Flowering Time and Maturity 

We genotyped all the 180 cultivars (accessions) at the El, E2, E3 
and E4 genes in this study. Four allelic variations, El, el -as, el-fs, 
el-nl, were identified at the El locus [19] (Figure lA to C). 
According to multivariate analysis (Generalized Linear Models, 
Type III Sum of Squares in IBM SPSS), statistical significance 
level (P) for the effect of the El allelic variation was within a range 
between 0.003 and 0.207 with an average of 0.0785 on flowering 
time (Rl, Table S4) and a range between 0.00 and 0.435 with an 
average of 0.092 on maturity (R7 or R8, Table S5) in all 
geographic locations in 2011 and 2012. 

Only two allelic variations, E2 and e2 were observed in this 
study (Figure ID, [18]). The effect of E2 aUehc variations is 
reffected by significance levels (P) ranging from 0.005 to 0.657 with 
an average of 0.233 on ffowering time (Rl, Table S4), from 0.01 1 
to 0.903 with an average of 0.449 on maturity (R7 or R8, Table 
S5). 

Five allelic variations including three recessive alleles, e3-fi, e3-tr, 
e3-A'Io, and two dominant alleles {E3-Ha, E3'Mi) were identified at 
the E3 locus (Figure IE to IG) [17,29,36]. The statistical P values 
for the effects oiE3 allelic variations on flowering time were within 
a range from 0.02 to 0.422 with an average of 0.201 in 2011 and 
2012 (Table S4). Also, effect oiE3 allehc variation on maturity (R7 
or R8) was within a range from P = 0.02 to 0.964 with an average 
of 0.361 (Table S5). 

For the E4 allehc variations, E4, e4-S0REl, e4'Kes, and e4-kam 
were found among the 180 cultivars, the latter three alleles were 
recessive (Figure IH to IJ) [16,29,30]. The significance levels (P 
value) for the effect of E4 allelic variation were ranged from 
P = 0.029 to 0.869 with an average of 0.342 on flowering time 
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Figure 4. The correlation between R1 and R3, R7 or R8 at Nanjing, Huaian, GongzKiuling and IMudanjiang locations, average of the 
two years of 2011 and 2012. A: Correlation between R1 and R3 in IVIudanjiang; B: Correlation between R1 and R7 in IVIudanjiang. C: correlation 
between R1 and R3 in Gongzhuling; D: Correlation between R1 and R8 in Gongzhuling. E: Correlation between Rl and R3 in Huaian; F: Correlation 
between Rl and R8 in Huaian. G: Correlation between Rl and R3 in Nanjing; H: Correlation between Rl and R8 in Nanjing. 
doi:l 0.1 371 /journal.pone.0097636.g004 



(Table S4), and ranged from 0.022 to 0.238 with an average of 
0.097 on maturity (R7 or R8, Table S,5). 

The percentages of recessive allele for El, E2, E3 and E4, were 
38.34%, 84.45%, 36.33%, and 7.20%, respectively. Based on 
statistical analysis, interaction effects of E1*E2, E1*E3 and E2*E3 
on flowering time (Rl) were within statistical P values ranging 
from 0.056 to 0.892 (average 0.463), from 0.008 to 0.375 (average 
0.109), and from 0.024 to 0.674 (average 0.336), respectively, 
among different geographic locations. The significant level (P) for 
the interaction effect for E1*E2*E3 was from 0.052 to 0.974 with 
an average of 0.591 (Table S4). 

Similarly, interaction effects on maturity (R7 or R8) for E1*E2, 
E1*E3, E2*E3 were within ranges from 0.262 to 0.962 (average 
0.628), from 0.003 to 0.873 (average 0.500), and from 0.013 to 
0.659 (average 0.284), respectively, among different geographic 
locations. 

3. Eight Genotypic Groups Were Classified 

All the 180 cultivars were classified into eight genotypic groups 
based on the combinations of allelic variations at the four major E 
loci. El, E2, E3, and E4 (Table 3). 

(1) el-nf Group. Cultivars in this group refer to as non- 
functional El gene, consisting of three subtypes: el^s (frame-shift), 
el-nl (nuU type), and el -as {e2, e3, e4) (Tables 1, SI). First, cultivars 
Satamotowase and 9E carry el-fs allele. Second, cultivars, Dongda 
2, Toshidai 7910, Toyosuzu, and Yukihomare, are of el-nl, lacking 
the El gene and its proximate sequence. Third, the genotype of ei- 
as,e2,e3,e4 is included in this group, considering that el -as is 
partially functional and its expression is totally suppressed under 
long day condition under {e2) e3e4 genetic background [19], e.g. 
Fiskeby V, a Sweden photoperiod insensitive cultivar (Table SI). 
The genotypes of several cultivars, e.g. Sakamotowase and 
Toyosuzu, are consistent with the result of Tsubokura et al. 
[29]. Geographically, three Chinese cultivars were coming from 
Heilongjiang, the most northern province of China. In addition, 
one, two, two, five accessions were from Sweden, the USA, 
Canada, and Japan, respectively. 

Average Rl of this genotypic group was 34.35± 10.01 days for 
all the six sites from 2011 to 2012. The longest Rl with an average 
of 49.54 days was observed in Harbin; and the shortest one was 
observed in Nanjing with an average of 20.31 days (Figure 5). 

This group matured very early in all sites when compared with 
the other groups, and the ranges of R7 or R8 in the three northern 
sites were not significantly different from that in Southern sites 
(Figure 6). At the all sites, the averages of R7 and R8 were 
85.83± 14.58 days and 94.95± 16.57 days, respectively. 

(2) el-asE4 Group. Culti\'ars or accessions of this group 
carry tl-as at the El locus with dominant E4 allele, while e2 and e3 
are recessive (Tables 3, SI). Most cultivars in this group are being 
widely used in soybean cultivation in Heilongjiang, the major 
soybean production area in China, e.g. Heinong 44, Kengfeng 17, 
Hefeng 50, and Hefeng 55 (Table SI). Also some landraces from 
the northern regions of China are within this group. Average Rl 
of this genotypic group was 37. 65± 10.04 days for all six sites in 
201 1 and 2012 (Figure 6, Table SI). The longest Rl was observed 
in Harbin with an average of 52.31 days while the shortest Rl was 
observed in Nanjing with an average of 24.10 days. 



(3) el-asE3E4 group. Cultivars with genotypes of el-as, 
recessive e2, and dominant E3 and E4 are classified into this group 
(Tables 3, SI). Cultivars in this group are mainly from Jilin and 
Liaoning provinces, both of which are located a few hundred 
kilometers south of Heilongjiang, in the northeast part of China. 
However, cultivars Hefeng 51 and Suinong 22 are widely used in 
Heilongjiang province (Table SI). Also cultivar Amsoy from the 
USA belongs to this group. 

Average Rl of this genotypic group was 38.86 ±10.47 days for 
all the six sites in 201 1 and 2012. The longest Rl was observed in 
Harbin with an average of 57.06 days while the shortest one was 
observed in Nanjing with an average of 27.88 days. 

(4) el-asE2(E3)E4 Group. Cultivars of this genotype are 
generally composed of two ecological types, el-asE2E4 and el- 
asE2E3E4 (Tables 3, SI). Although nine cultivars are classified into 
this genetic group, their origins are quite different; apart from 
three landraces and two foreign accessions, three cultivars, 
Gaofeng 1, Handou 5 and Sidou 11, were bred geographically 
in the Yellow River regions (Shandong, Hebei and the Northern 
region of Jiangsu province). The landraces, Moshidougong 503 
and Xiaolimushidou are classified into the el-as-E2e3-MoE4 
genotype (Table SI). The cultivar Moshidougong 503 has been 
used for linkage map construction and gene cloning of E2 and E3 
[17,18,19,37]. Cultivars with el-asE2E3E4 genotype were mainly 
coming from Shandong, Hebei or Northern Jiangsu, in down- 
stream of the Yellow river. 

Average Rl of this genotypic group was 46. 13± 14.29 days 
(Figure 5) for the six sites and in 20 1 1 and 20 12. The longest Rl of 
58.56 days was observed in Harbin, and the shortest Rl with an 
average of 31.11 days was observed in Nanjing. 

Average R7 and R8 of this genotypic group was 102. 25± 15.06 
days for all six sites in 2011 and 2012. Average R8 of this 
genotypic group was 1 10. 8± 17.29 days for all six sites in 201 1 and 
2012 (Figure 6). 

(5) El Group. Cultivars of this genetic group are character- 
ized by dominant El and recessive alleles of e2, e3 and e4 (Tables 3, 
SI). Apart from an American cultivar, three cultivars, Kariyutaka, 
130 L and Iwahime, from Japan are of this genotype. No cultivar 
from China was classified into this group in this study. 

Average Rl of this genotypic group was 36. 64± 10.56 days at 
the all six sites in 201 1 and 2012 (Figure 5). The longest Rl was 
observed in Harbin with an a\'cragc of 55.25 days, and the shortest 
one, an average of 21.25 days, was observed in Nanjing 

Average R7 of this genotyjoic group were 92.33± 15.46 days for 
all six sites and in 201 1 and 2012 (Figure 6). While average R8 of 
this genotypic group was 101. 87± 14.25 days for all six sites in 
2011 and 2012. 

(6) E1E4 Group. This genetic group had dominant El and 

E4 alleles but with recessive e2 and e3 alleles (Tables 1, SI). Apart 
from some landraces and foreign cultivars, cultivars of this 
genotype are elite cultivars currendy largely used in Heilongjiang, 
JUin, and Liaoning Provinces. 

Average Rl of this genotypic group was 43.76 ±16.96 days for 
all six sites in 2011 and 2012 (Figure 5). The longest Rl was 
observed in Harbin with an average of 65.81 days, while the 
shortest ones were observed in Nanjing with an average of 27.71 
days. 
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Average R7 of this genotypic group was 98.22 ±13.23 days for 
all six sites in 2011 and 2012 (Figure 6). Average R8 of this 
genotypic group was 108. 20± 16.22 day for all six sites in 2011 
and 2012 (Figure 6). 

(7) E1E3E4 Group. Except for the recessive e2 allele, El, E3 
and E4 alleles were dominant for this genetic group (Tables 3, SI). 
Of all 180 cultivars, 67 cultivars or accessions are classified into 
this group (Table 1), the largest one of eight groups classified in 
this study. The geographic distribution of this group is much 
diversified, from the northern HeUongjiang Province, to southern 
Jiangsu Province (the region along the Yangtze River). 

Average Rl of this genotypic group was 53.21 ± 18.98 days at all 
the six sites in 2011 and 2012 (Figure 5). The longest Rl was 
observed in Harbin with an average of 78.53 days, while the 
shortest one with an average of 3 1 .54 days was observed in Huaian 
(Figure 5). At the maturity stage, average R7 and R8 of this 
genotypic group were 104.22± 16.06 and 1 14. 23± 16.06 days at 
all the six sites in 2011 and 2012, respectively. 

(8) E1E2E3E4 Group. Cultivars of this genetic group have all 
dominant El to E4 alleles, most of which are geographically from 
the southern areas, Jiangsu, Shanghai and Anhui Provinces 
(Tables 3, SI). 

Average Rl of this genotypic group was 61.71 ±22.74 days for 
all the six sites in 2011 and 2012 (Figure 5). The longest Rl was 
observed in Harbin with an average of 90.83 ±40. 16 days. The 
shortest one was observed in Nanjing with an average of 40.16 
days (Figure 5). Average R7 of this genotypic group was 
109.62± 15.17 days for all six sites in 2011 and 2012. Average 
R8 of this genotypic group was 1 15.77 ± 14. 16 days for all six sites 
in 2011 and 2012 (Figure 6). 

Some outiiers (shown as dots in Figures 5 and 6) and 
representative cultivars showing contrasting Rl or R7 or R8 
within the same genetic groups were selected for further 
phenotypic confirmation in 2013. The result showed that 
phenotypic performance in 2013 was consistent with that in 

2011 and 2012 with high correlation coefficients (Table S6). 

4. Comparison of Phenotypic Performance between 
Different Genetic Groups 

We performed statistical comparisons of the time to flowering 
(Rl) and maturity (R7 or R8) among dilferent genetic groups. 

(1) el-nf vs El. The genetic group of el-nf carries either 
nonfunctional type of el or partial functional el -as coupled with 
recessive e2, e3 and e4 background. Flowering time (Rl) or 
maturity (R7 or R8) of the el-nf group were the earhest ones 
among all genetic groups. The El genetic group flowered a little 
later than the e 1 -nf group but not statistically different at all sites 
(Tables S7, S5). The El genetic group was only found in cultivars 
from Japan and the USA, including Kariyutaka, which was used 
for transformation of the El gene for functional confirmation. 
Since recessive e3e4 can suppress the expression of the Elov el -as 
gene, El genotype carrying recessive e2, e3 and e4, is phenotyp- 
ically simflar to the el-nf The subtle differences between the two 
groups might be detected if more cultivars of the El type are used. 

(2) el-asE4 vs E1E4. The average difference of 14.31 days in 
flowering time (Rl) between these two groups evaluated in Harbin 

2012 reached a significant level of P<0.001 (Table S7). At tiie 
same site, the difference in Rl between the two groups in 201 1 was 
also statisticaUy significant (P<0.05). At another northern location 
Gongzhuling in 201 1, the difference in Rl between the two groups 
was also significant at P<0.01 (Table S7). At Mudanjiang, another 
northern site, the difference in Rl was between 6.51 and 8.58 days 
(P<0.05) in the two years (Table S7). For R7 or R8 at aU the six 
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Table 3. Genotypic groups were classified based on tine allelic variations in El, E2, E3 and £4 genes. 





Genetic group 


Genotype at E1,E2L,E3, E4 genes 


No of Cultlvars 


Geographic origin (frequency) 


el-nf 


eT-as,e2,e3-fs,e4-kes 


1 


Heilongjiang(l) 




el-as,e2,e3-tr,e4-SOREl 


1 


Heilongjiang(l) 




el-fs,e2,e3-tr,E4 


2 


Japan(2) 




el-nle2,E3-Ha,e4-SOREl 


2 


USA(2) 




el-nl,e2,E3-Mi,E4 


1 


Japan(l) 




el-nhe2,E3-Mi,e4-SOREl 


1 


Japan(l} 




el-n 1,e2,e3-tr,E4 


2 


Heilongjiang(l}, Canada(l) 




el-nl,e2,e3-tr,e4-SOREl 


3 


Japan(l}, Canada(l}, Sweden{l) 


el-asE4 


el-as,e2,e3-fs,E4 


5 


Heilongjiang(l), Jilin(l), USA(3) 




e1-as,e2,e3-tr,E4 


25 


Heilongjiang(21), Jilin(3), Inner Mongolia(l) 


E1-asE3E4 


el-as,e2,E3-Ha,E4 


10 


Jilin(3), Liaoning(l), Heilongjiang(l), China*(l), USA(4) 




el-as,e2,E3-Mi,E4 


7 


Heilongjiang(4}, Jilin(2}, Liaoning(l) 


el-asE2(E3)E4 


el-as,E2,e3-fs,E4 


1 


Heilongjiang(l} 




el-as,E2,e3-Mo,E4 


2 


Jilin(l), China* (1) 




el-as,E2,E3-Ha,E4 


5 


Jilin{l), Shandongd), Jiangsu{l), USA(2) 




el-as,E2,E3-Mi,E4 


1 


Shandong(l) 


El 


El,e2,e3-tr,e4-SORE1 


3 


Japan(2), USA(l) 




E T ,e2,e3-tr,e4-kam 


1 


Japan(l) 


E1E4{E3) 


El ,e2,e3-fs,E4 


1 


Jilin(l ) 




El,e2,e3-tr,E4 


19 


Heilongjiang(9}, Liaoning(3}, Jiangsu(l), Jilin{1), Japan{3), 
USA(2) 




El,e2,E3-Ha,e4-kes 


1 


Jilin(l) 


E1E3E4 


El,e2,E3-Ha,E4 


20 


Jilin(7), Jiangsu(3), Heilongjiang(2), Shandong(2), 

yinii^nnfU Rpiiinnfll Hpnanfl^ AnhiiiH^ IISAO^ 

AlllJIallU^I/, Dt^MIIIU^I^, nCllall^iy, mi[IUI\iy, 




El,e2,E3-Mi,E4 


47 


Jiangsu(20}, Jilin (12), Liaoning(3), Heilongjiang{2), 
Beijing{2), China* (2), Gansu{1), Henan(1}, Hubei(l), 
Xinjiang(l), Shandong(l), Japan(l), 


E1E2E3E4 


El,E2,E3-Ha,E4 


8 


Jilin(2), Jiangsu(2), Heilongjiang(l), Liaoning(l), 
Shandongd), USA(1) 




El,E2,E3-Mi.E4 


11 


Jiangsu(6), Jilin(l), Heilongjiang(l), Beijing(l), 
Shanghai(l), Japan(l) 


Total 


25 


180 





*No more detailed geographic information available. 
doi:l 0.1 371/journal.pone.0097636.t003 



sites, the maximum difference between tlie two groups was up to 
7.55 days, not statistically different (Table S8). 

Assuming no other genes are involved in, the difference between 
two groups merely reflects the functional difference between El 
and el -as under a genetic background of e2 and e3. The difference 
in Rl between the two groups appeared significantly only in the 
northern sites with longer day in this study. 

(3) el-asE3E4 vs E1E3E4. Rl observed at the north four 
sites between the two groups in 2011 and 2012 were all 
significantly different, while the difference at Huaian and Nanjing 
sites was not significant (Table S7). The difference in R7 between 
the two groups at Huaian in 201 1 was significant (P<0.05); while 
both differences in R7 and R8 in Gongzhuling in 2012 were over 
10 days, significantly different at P<0.01 (Table S8). 

Since the E3E4 background and long day condition can 
promote the expression oi El or el -as, theoretically the functional 
difference between El and el -as can be clearly discerned under 
such genetic and environmental conditions. 

(4) el-asE2(E3)E4 vs E1E2E3E4. At all the sites the 
difference in Rl between two groups was from 8 to 28 days. In 



the most southern site Nanjing, the difference in Rl between the 
two groups in 2011 and 2012 was 8-9 days, although not 
significandy different (Table S7). For all other sites, the difference 
in Rl reached statistical significance (Table S7) in either 20 11, or 
2012, or both years. 

At the most southern site, Nanjing, the difference in R7 between 
the two groups in 201 1 and 2012 was 18-19 days, significandy at 
P<0.001 (Table S8). At another southern site, Huaian, the 
differences for R8 in 201 1 and for R7 in 2012 between two groups 
were significant at P<0.01. The differences in R7 or R8 in the 
most northern sites, Mudanjiang and Gongzhuling did not occur 
at any statistical significant level possibly due to most cultivars in 
the E1E2E3E4 genetic group not reaching the stages of R7 or R8 
before frost. The comparisons between two genotypes indicated 
that the phenotypic difference in Rl between el -as and El under 
E2E3E4 background were significandy larger than the other 
backgrounds. 

(5) El vs E1E4. The differences in Rl at all locations in 201 1 
an2012 between the two groups did not reach statistical 
significance (Table S7). However, the significant difference (P< 
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Figure 5. The phenotypic variations in R1 among different genotypic groups. The phenotypic segregation is shown in box-plot format. The 
interquartile region, median, and range are indicated by the box, the bold horizontal line, and the vertical line, respectively. The 12 panels (A to L) 
represented 6 experimental locations in 2011 and 2012, respectively. A: Harbin in 2011; B: Harbin in 2012; C: IVIudanjiang in 2011; D: IVIudanjiang in 
2012; E: Gongzhuling in 2011; F: Gongzhuling in 2012; G: Jinan in 2011; H: Jinan in 2012; I: Huaian in 2011; J: Huaian in 2012; K: Nanjing in 2011; L: 
Nanjing in 2012. 

doi:l 0.1 371/journal.pone.0097636.g005 



0.05) in R7 and R8 was not detected at any site in 20 II or 2012 
between the two groups, except for R8 at Gongzhuling in 2012 at 
P<0.05 (Table S8). This result indicates that difiFerence in 
functional effect of the El gene under e2e3E4 background rather 
than under the e2e3e4 background is quantitative, but not 
significant. 

(6) E1E4 vs E1E3E4. The differences in RI between the two 
groups at Harbin site both in 20 II and 2012 were between 14.22 
and 16.53 days (P<0.001) (Table S7). Also the differences at 
Gongzhuling in both 201 1 and 2012 were 12.54 and 13.35 days, 
respectively, reaching statistical significance at P<0.001 (Table 
S7). At the all other sites, the differences between two groups were 
between 5 and 13 days, not significantly different (Table S7). 

At maturity stage, the differences in R7 or R8 between two 
groups in Gongzhuling in 2011 and 2012 were significant at 
various level of P<0.05, P<0.01 or P<0.001. The differences in 
R7 in Huaian and Jinan in 201 1 also reached significant level at 
P<0.05 and P<0.001, respectively (Table S8). 

These results indicate that the function of the El gene under 
E3E4 background was more prominent than that under e3E4 
background; however, the difference between two backgrounds 
was reduced in the southern areas with shorter daylength at 
vegetative growing stage. 

(7) E1E4 vs E1E2E3E4. Except for Jinan in 2012, the 
differences in Ri between two groups at all the locations in the two 
years were from 10 to 29 days, reaching statistical significant level 
at P<0.01, mosdy at P<0.001 (Table S7). 

At the later maturity stages, the differences in R7 or R8 between 
the two groups at the three northern locations were not 
significandy different. In contrast, at the three southern locations, 
the differences in R7 or R8 between two groups were larger, 
mostly at P<0.001 (Table S8). For example, the differences in R7 
between two groups at Nanjing were fromi 7.82 to 18.54 days (P< 
0.001, Table S8). In Mudanjiang and Gongzhuling, most cultivars 
in the E1E2E3E4 genetic group could not mature before frost. 

(8) E1E3E4 vs E1E2E3E4. Large differences of 15-21 days 
in RI between the two groups were observed in Mudanjiang in 
201 1 and 2012 (P<0.001) (Table S7). In die most soudiern 
location Nanjing, no significant difference (6-7 days) in RI 
between two groups was detected. In other sites, e.g. Harbin, 
Gongzhuling, Jinan, and Huaian, the differences in RI between 
the two groups in 201 1 and 2012 reached significant levels at P< 
0.05, P<0.01 or P<0.001 (Table S7). 

At the later maturity stage, the difference in R7 and R8 at 
Huaian site in 2012 reached significance (P<0.01 or P<0.001) 
(Table S8). Similarly, in Nanjing, the differences in R7 in 2011 
and 2012 reached significance l(-v(;l at P<0.001 level (Table S8). 

The difference between two groups reflects the functions of the 
E2 gene, which might not strongly be associated with photoperiod 
response [18]. 

5. Transcript Abundance of a Functional El Allele Was 
Significantly Related to Flowering Time and Maturity 

Due to the prominent effect of the El gene on flowering time, 
we analyzed the correlation between expression levels of functional 
El gene and the flowering time. In the northern location of 
Harbin, we sampled the leaves of cultivars carrying the El allele or 



el-as allele within 2.5 to 3.0 hr after dawn at seedling stage (14 
days after emergence) on May 20 (the day length was 16.10 h). 
The exact zeitgeber time (after dawn) for sampling was considered 
the first peak of a bimodal diurnal pattern for El expression 
appears around 2-4 hr after dawn ([19], Abe et al, unpublished 
data). We analyzed the correlation between RI and the expression 
levels of the El (el-as) gene, and a significant correlation was 
detected for the El gene (r = 0.8600**, P<0.01) and for die el-as 
gene (R = 0.7293**, P<0.01) (Figure 7). This result is consistent 
with that our previous work where the transcriptional level of the 
El gene in transgenic soybean was significantly related to the 
flowering phenotype [19]. Also we can conclude that el-as is also a 
functional allele at the El locus, though its function in repressing 
flowering is not as strong as the El gene. 

Discussion 

Each soybean cultivar generally has a specific geographic or 
latitudinal distribution for its high yield. In soybean production, 
the northern cultivars with high photoperiod insensitivity flower 
very early when grown in the southern area, which limits biomass 
accumulation before flowering, and results a lower yield. On the 
other hand, the southern cultivars grown in the northern area 
might not be able to mature before frost. In Northern America, 
cultivars are classified into thirteen maturity groups l)ascd on the 
flowering time and maturity data. In China, researchers have also 
developed a system for classification of cultivars based on flowering 
time and maturity [38]. Some researchers have tried to 
synchronize different maturity group systems used in the USA 
and in China [39]. Understanding of the molecular basis or 
mechanisms involved in controlling maturing is very important for 
marker assisted selection (MAS) and for breeding by molecular 
design in the future. 

1. The Magnitude of Effects of Four E Genes and Their 
Combinations on Flowering Time and Maturity 

In this study, cultivars of the el-nf group either having 
nonfunctional allele or putatively suppressed expression level of 
the el-as gene displayed a very early flowering time phenotype, 
which confirmed that the El is the most important locus for this 
trait. Similar to that of the El gene, transcript abundance oi el-as 
gene was significandy correlated with flowering time, indicating 
el-as is also a functional allele. Previous studies demonstrated that 
functional pathways ot E3 and E4 are different but overlapping, at 
least partially through regulation El expression [19,40]. Also, 
the effects of E3 and E4 revealed in this study are consistent with 
the results obtained in the previous genetic studies using Harosoy 
or Clark isogenic fines [20,26]. The percentages of recessive alleles 
for the El, E2, E3 and E4 loci were 38.34%, 84.45%, 36.33%, and 
7.20%, respectively. This result indicates that dominant E4 and 
recessive e2 are the most common genetic makeups among 
Chinese cultivars, whereas the El and E3 loci have been 
undergone a high selection pressure in breeding in China. 

2. Genes Other Than the Four Major El Genes Exist 

Among the known loci controlling flowering and maturity, E5 to 
E8, and locus J have not been cloned. Recentiy, in some specific 
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Figure 6. The phenotypic variations in R7 or R8 among different genotypic groups. The phenotypic segregation is shown in box-plot 
format. The interquartile region, median, and range are indicated by the box, the bold horizontal line, and the vertical line, respectively. A: R8 at 
Mudanjiang in 201 1 ; B: R8 at Mudanjiang in 201 2; C: R7 at Gongzhuling in 201 1 ; D: R7 at Gongzhuling in 201 2; E: R7 at Huaian in 201 1 ; F: R7 at Huaian 
in 2012; G: R7 at Nanjing in 2011; H: R7 at Nanjing in 2012. 
doi:1 0.1 371/journal.pone.0097636.g006 



genotype backgrounds, some Arabidopsis flowering homologs were 
identified to be responsible for flowering time by QTL using 
mapping although further functional confirmation is needed 
(Watanabe et al, unpublished data). Since we have not sequenced 
the coding region or promoter region oiEl to E4 loci, there might 
be some structural or indel or SNP variations not yet detected 
[29]. As indicated in this study as well as the previous genetic 
study, some new genes are involved in control of flowering time, at 
least partially through regulation of the expression level of the El 
{el -as) gene. 

3. Similar Phenotypic Performance for Flowering Time 
Can Be Achieved with Different Allelic Combination at 
the El to E4 loci 

As revealed in this and previous studies [29,30], there are a 
large number of allelic variations among the coding regions and 
promoter regions. Additionally, there are other known (e.g. E5 to 
E8, J) or unknown genetic factors that are involved in the control 
of flowering time and maturity. More than one genetic groups 
identified in this study displayed similar or overlapping range of 
phenotypic performance among the cultivars. Theoretically, the 
improvement of a given trait can be achieved by various genetic 
combinations in breeding. On the other hand, if two cultivars 
displaying similar phenotype can have different genotypes, 
transgressive inheritance may occur in a cross between two such 
cultivars. 

4. Future Prospective and Conclusion 

Flowering time and maturity affect yield directly. Appropriate 
flowering time or maturity period are fundamentally required for 
soybean cultivars grown in different geographic regions. In Brazil, 
the introduce of the J locus, controlling the long juvenile trait, 
allowed the expansion of the soybean production area near north. 



Secondly, the E genes have effects on branching and growth habit. 
In 1930s, the breeding pioneer Jinling Wang already developed 
the most precocious cultivars, extending the frontier of soybean 
production zone further to the north by several hundred 
Kilometers in China [41]. Now we gradually understand that 
el-nf genetic groups are approximately corresponding to cultivars 
of MG 000, MG 00, or MG 0 groups, since two cultivars from 
Canada and the cultivars from the most northern soybean 
production area, HeUongjiang, China belong to this el-nf group. 

According to a previous study, the El gene also plays an 
important role in seed yield, plant height lodging and seed 
composition [26], since it can control TFLl expression [30]. On 
the other hand, the function of El might also be related to 
branching trait, as indicated by a QTL mapping study [27]. In 
Canada, Cober and Morrison tried to use the Ele3e4dtl genotypic 
combination to elevate the position of the first pod [26] . 

One of the important goals for breeding is to improve yield of 
soybean cultivars. Considering that the flowering time and 
maturity are the simple but important traits affecting geographic 
adaption and yield [26], a full understanding of their precise 
regulation mechanism will enable us to breed cultivars with the 
appropriate flowering or maturity characters by design, possibly 
serve as a model for other more complicated traits e.g. yield and 
quality in soybean. 

Supporting Information 

Table SI Generally information and phenotypic data of 
180 accessions used in this study. 

(XLSX) 

Table S2 Primers used for genotyping in this study. 

(XLSX) 




Figure 7. Correlation coefficients between the transcript abundance of the El (A) or e7-as (B) genes and flowering time of cultivars 
grown at Harbin, in 2012. 

doi:1 0.1 371/journal.pone.0097636.g007 
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Table S3 Flowering time (Rl) and maturity (R7 or R8) 
data in different geographic locations in 2011 and 2012. 

(XLSX) 

Table S4 Multivariate analysis of allelic variations at 
El, E2, E3, and E4 and their combinations on flowering 
time (Rl) using IBM SPSS Statistics 17.0 (www-Ol.ibm.- 
com/software/analytics/spss) based on Generalized 
Linear Models (GML) and Type III Sum of Squares. 
PCLSX) 

Table S5 Multivariate analysis of aUelic variations at 
El, E2, E3, and E4 and their combination on maturity 
(R7 or R8) using IBM SPSS Statistics 17.0 (wvrw- 
01.ibm.com/sofitware/analytics/spss) based on Gener- 
alized Linear Models and Type III Sum of Squares. 
PCLSX) 

Table S6 The correlation analysis of flowering time 
(Rl) and R7 or R8 for about 50 cultivars (accessions) 
between 2013 and 2011/2012 at four experimental 
locations. 

Table S7 Two-way ANOVA analysis with Bonferroni 
posttests' test for flowering time (Rl) of different genetic 

References 

1. Stupar R M, Spccht JE (2013) Insights from the soybean {Glycine max and Glycine 
soja) genome: past, present, and future. Adv Agron 118:177-204. 

2. Hartwig EE (1973) Varietal development. In: Caldwell BE, Editor. Soybeans: 
lm];)rovement, Production and Uses. Madison: ASA, CSSA, and SSSA.pp. 187- 

210. 

3. Norman A(} (1978) Soybean Physiology, agronomy and utilization. N Y 
Academic Press. 249 p. 

4. Cober ER, Voldeng HD (2001) A new soybean maturity and photoperiod- 
sensitive locus linked to El and T. Crop Sci 41: 698-701. 

5. Zhang LX, Kyei-Boahen S, Zhang J, Zhang MH, Freeland TB, et al. (2007) 
Modifications of optimum adaptation zones for soybean maturity groups in the 
USA. Crop Management. Crop Manage 6: 1, doi:10.1094/CM-2007-0927-01- 
RS 

6. Carner WW. Mlard HA (1920) Effect of the relative length of day and night and 
other factors of the environment on growth and reproduction in plants. J Agric 

Res 18:553 GOG. 

7. Bernard RL (1971) l\vo major genes for time of flowering and maturity in 
soybeans. Crop Sci 11: 242-244. 

8. Buzzell Rl (1971) Inheritance of a soybean flowering response to fluorescent- 
daylength conditions. Can J Genet Cytol 13: 703-707. 

9. Buzzell Rl, Voldeng HD (1980) Inheritance of insensitivity to long day length. 
Soybean Genet Newsl 7: 26-29. 

10. MeBlain BA, Bernard RL (1987) A new gene affecting the time of flowering and 
maturity in soybean. J Hered 78: 1G0-1G2. 

1 1. Bonato ER, Velio NA (1999) E6, a dominant gene conditioning early flowering 
and maturity in soybeans, (ienet Mol Biol 22: 229—232. 

12. Cober ER, Molnar SJ, Charette M, Voldeng HD (2010) A new locus for early 
maturity in soybean. Crop Sei 50: 524-527. 

13. RayJD, Hinson K, Mankono EB, Malo EM (1995) Genetic control of a long- 
juvemle trait in soybean. Crop Sci 35: 1001-lOOG. 

14. Kumudini SV, Pallikonda PK, Steele C (2007) Photoperiod and e-genes 
influence the duration of the reproductive phase in soybean. Crop Sci 47: 1510- 
1517. 

15. Xia Z, Zhai H, Lii S, Wu H, Zhang Y (2013) Recent achievement in gene 
cloning and functional genomics in soybean. ScientificWorldJournal, 
2013:281367. doi: 10.1155/2013/281367 

16. Liu B, Kanazawa A, Matsumura H, 'I'akahashi R, et al. (2008) (ienetie 
redundancy in soybean photorcsponses associated with duplication of the 
phytochrome A gene. Cienrrics 180: 995-1007. 

17. Watanabc S, Hidcshima R, Xia Z, Tsubokura Y, Sato S, et al. (2009) Map- 
based cloning of the gene associated with the soybean maturity locus E3. 
Genetics 182:1251-1262. 

18. Watanabe S, Xia Z, Hideshima R, Tsubokura Y, Sato S, et al. (201 1) A map- 
based cloning strategy employing a residual heterozygous line reveals that the 
GIGANTEA gene is involved in soybean maturity and flowering. Genetics 188: 
395-407. 

19. Xia ZJ, Watanabe S, Yamada T, Tsubokura S, Nakashima H, et al. (2012) 
Positional cloning and characterization revesd the molecular basis for soybean 



groups among difFerent locations in 2011 and 2012 using 
GraphPad Prism version 5.00. 

(XLSX) 

Table S8 Two-way ANOVA analysis with Bonferroni 
posttests' test for maturity (R7 or R8) data of different 
genetic groups among different locations in 2011 and 
2012 using GraphPad Prism version 5.00. 

(XLSX) 

Acknowledgments 

We would like to thank Y Tsukubora, S Watanabc, K Harada from 
National Institute of Agrobiological Sciences, Japan for proving the DNA 
marker information, Shuming Wang from Jilin Academy of Agricullural 
Science, for providing most accessions used in this study, W' Fu from 
GeoData Service Center of Northeast, Northeast Institute of Cieography 
and Agroecology, for collecting the weather data, Professor Cober ER from 
Agric. & Agri-Food Canada, Eastern Cereal and Oilseed Research Centre 
for language editing. 

Author Contributions 

Conceived and designed the experiments; ZX HZ SL. Performed the 
experiments: HZ SL JY WC CZ HG HQ,XZ TC. Analyzed the data: ZX 
HZ SL. Contributed reagcnts/materials/analysis tools: YW XC HR. 
Wrote the paper: ZX SL HZ. 



maturity locus El, which regulates photoperiodic flowering. Proc Natl Acad Sci 
USA 109: E2 155-2 164. 

20. Cober ER, Tanner JW, Voldeng HD (1996) Soybean photoperiod-sensitivity 
loci respond differentially to light quality. Clrop Sci 36: 606-610. 

21. Abe J, Xu DH, Miyano A, Komatsu K, Kanazawa K, ct al. (2003) Photoperiod- 
insensitive Japanese soybcEin land- races differ at two maturity loci. Crop Sci 43: 
1300-1304. 

22. Sawa M, Nusinow DA, Kay SA, Imaizumi T (2007) FKFl and GIGANTEA 
complex formation is required for day-length measurement in Arabidopsis. 
Science 318: 261-265. 

23. Sawa M, Kay SA (2011) GIGANTEA directly activates Elowering Locus Tin 
Arahidofms thaliami. Proc Nad Acad Sci USA 108: 1 1698-1 1703. 

24. Gwen FV (1927) Inheritance studies in soybeans. 11. (ilabrousness, color of 
pubescence, time of maturity, and linkage relations. Geneticsl2:519— 529. 

25. Schmutz J, Cannon SB, Schlucter J, Ma J, Mitros T, et al. (2010) Genome 
sequence of the palacopolyploid soybean. Nature 463: 178—183. 

26. Cober ER, Morrison MJ {2010} Regulation of seed yield and agronomic 
characters by photoperiod sensitivity and growth habit genes in soybean. Theor 
Appl Genet 120:1005-1012. 

27. Sayama T, Hwang TY, Yamazaki H, Yamaguchi N, Komatsu K, et al. (2010) 
Mapping and comparison of quantitative trait loci for soybean branching 
phenotvpe in u\o locations. Breed Sci 60:380-389. 

28. Funatsuki H, Kawaguchi K, Matsuba S, Sato Y, Ishimoto M (2005) Mapping of 
QTL associated with chilling tolerance during reproductive growth in soybean. 
Theor Appl Genet 111: 851-861 

29. Tsubokura Y, Watanabe S, Xia Z, Kanamori H, Yamagata H, et al. (2014) 
Natural variation in the genes responsible for maturity loci £7, E3 and E4 in 
soybean. Ann Bot 113: 429-441 

30. Xu M, Xu Z, Liu B, Kong fj, Tsubokura Y, et al. (2013) Genetic variation in 
four maturity genes affects photoperiod insensitivity and PHYA-regulated post- 
flowering responses of soybean. BMC Plant Biology 13: 91—104. 

31. Murray MG, Thompson WF. (1980). Rapid isolation of high-molecular-weight 
plant DNA. Nucleic Acids Res 8: 4321-4325. 

32. Kawasaki S, Murakami Y (2000) Genome analysis of Lotus japonicm. ] Plant Res 
113: 497-506. 

33. Xia Z, Tsubokura Y, Hoshi M, Hanawa M, Yano C, et al. (2007). An integrated 
high-density linkage map of soybean with RFLP, SSR, STS, and AFLP markers 
using A single F2 population.DNA Res 14(6):257-69. doi: 10.1093/dnares/ 

dsm027. 

34. Fehr WR, Caviness CE, Burmood DT, Pennington JS (1971) Stage of 
development descriptions for soybeans. Glycine max (L.) Merrill. Crop Sci 
11:929-931. 

35. Willems E, Leyns L, Vandesompele J (2008) Standardization of real-time PGR 
gene expression data from independent biological replicates. Anal Biochem 379: 
127-129. 

36. Shin JH, Lee SH (2012). Molecular markers for the E2 and E3 genes controlling 
flowering and maturity in soybean. Mol Breed 30: 1793-1798. 

37. Yamanaka N, Watanabe S, Toda K, Hayashi M, Fuchigami H, et al. (2005) Fine 
mapping of the FTl locus for soybean flowering time using a residual 



PLOS ONE I www.plosone.org 



15 



May 2014 I Volume 9 | Issue 5 | e97636 



Genotypes and Expression of E Genes in Soybean 



heterozygous line derived from a recombinant inbred line. Theor Appl Genet 
110: 634-639. 

38. Gai JY, Wang YS, Zhang MC, Wang JA, Chang RZ (2001) Studies on the 
classification of maturity groups of soybeans in China. Acta Agronomica Sinica 
27: 286-292. 

39. Wang YS, Qin JB, Gai JY, He GG (2006) Classification and characteristic of 
maturity groups of Chinese landraces of soybean [Glycine max (L.) Merr.] Genet 
Resour and Crop Evo 53: 803-809. 



40. Kong 1% Liu B, Xia Z, Sato S, Kim BM, ct al. (2010) Two Coordinatcly 
regulated homologs of J'LO VVERLNG LOCUS T arc involved in the control of 
photoperiodic flowering in soybean. Plant Physiol 154: 1220—1231. 

41. WangJL (1961) Soybean ecotypes and soybean cultivation and breeding. Sci 
Agric Sin, 2(1): 24-27 (in Chinese). 



PLOS ONE I www.plosone.org 



16 



May 2014 I Volume 9 | Issue 5 | e97636 



